406 research outputs found

    Composite repetition-aware data structures

    Get PDF
    In highly repetitive strings, like collections of genomes from the same species, distinct measures of repetition all grow sublinearly in the length of the text, and indexes targeted to such strings typically depend only on one of these measures. We describe two data structures whose size depends on multiple measures of repetition at once, and that provide competitive tradeoffs between the time for counting and reporting all the exact occurrences of a pattern, and the space taken by the structure. The key component of our constructions is the run-length encoded BWT (RLBWT), which takes space proportional to the number of BWT runs: rather than augmenting RLBWT with suffix array samples, we combine it with data structures from LZ77 indexes, which take space proportional to the number of LZ77 factors, and with the compact directed acyclic word graph (CDAWG), which takes space proportional to the number of extensions of maximal repeats. The combination of CDAWG and RLBWT enables also a new representation of the suffix tree, whose size depends again on the number of extensions of maximal repeats, and that is powerful enough to support matching statistics and constant-space traversal.Comment: (the name of the third co-author was inadvertently omitted from previous version

    Forecasting species distributions : correlation does not equal causation

    Get PDF
    This research was funded by the U.S. Department of the Interior Northeast Climate Adaptation Science Center, which is managed by the U.S. Geological Survey National Climate Adaptation Science Center. Additional funding was provided by T-2- 3R grants for Nongame Species Monitoring and Management through the New Hampshire Fish and Game Department and E-1- 25 grants for Investigations and Population Recovery through the Vermont Fish and Wildlife Department.Aim Identifying the mechanisms influencing species' distributions is critical for accurate climate change forecasts. However, current approaches are limited by correlative models that cannot distinguish between direct and indirect effects. Location New Hampshire and Vermont, USA. Methods Using causal and correlational models and new theory on range limits, we compared current (2014?2019) and future (2080s) distributions of ecologically important mammalian carnivores and competitors along range limits in the northeastern US under two global climate models (GCMs) and a high-emission scenario (RCP8.5) of projected snow and forest biomass change. Results Our hypothesis that causal models of climate-mediated competition would result in different distribution predictions than correlational models, both in the current and future periods, was well-supported by our results; however, these patterns were prominent only for species pairs that exhibited strong interactions. The causal model predicted the current distribution of Canada lynx (Lynx canadensis) more accurately, likely because it incorporated the influence of competitive interactions mediated by snow with the closely related bobcat (Lynx rufus). Both modeling frameworks predicted an overall decline in lynx occurrence in the central high-elevation regions and increased occurrence in the northeastern region in the 2080s due to changes in land use that provided optimal habitat. However, these losses and gains were less substantial in the causal model due to the inclusion of an indirect buffering effect of snow on lynx. Main conclusions Our comparative analysis indicates that a causal framework, steeped in ecological theory, can be used to generate spatially explicit predictions of species distributions. This approach can be used to disentangle correlated predictors that have previously hampered understanding of range limits and species' response to climate change.Publisher PDFPeer reviewe

    Lightweight BWT and LCP merging via the gap algorithm

    Get PDF
    Recently, Holt and McMillan [Bioinformatics 2014, ACM-BCB 2014] have proposed a simple and elegant algorithm to merge the Burrows-Wheeler transforms of a collection of strings. In this paper we show that their algorithm can be improved so that, in addition to the BWTs, it also merges the Longest Common Prefix (LCP) arrays. Because of its small memory footprint this new algorithm can be used for the final merge of BWT and LCP arrays computed by a faster but memory intensive construction algorithm

    THE RESULTS OF PANCREATIC RESECTIONS AND LONG- TERM SURVIVAL FOR PANCREATIC DUCTAL ADENOCARCINOMA : A SINGLE-INSTITUTION EXPERIENCE

    Get PDF
    Objectives: Since the early 1990s, low long-term survival rates following pancreatic surgery for pancreatic ductal adenocarcinoma have challenged us to improve treatment. In this series, we aim to show improved survival from pancreatic ductal adenocarcinoma during the era of centralized pancreatic surgery. Methods: Analysis of all pancreatic resections performed at Helsinki University Hospital and survival of pancreatic ductal adenocarcinoma patients during 2000-2013 were included. Post-operative complications such as fistulas, reoperations, and mortality rates were recorded. Patient and tumor characteristics were compared with survival data. Results: Of the 853 patients undergoing pancreatic surgery, 581 (68%) were pancreaticoduodenectomies, 195 (21%) distal resections, 28 (3%) total pancreatectomies, and 49 (6%) other procedures. Mortality after pancreaticoduodenectomy was 2.1%. The clinically relevant B/C fistula rate was 7% after pancreaticoduodenectomy and 13% after distal resection, and the re-operation rate was 5%. The 5- and 10-year survival rates for pancreatic ductal adenocarcinoma were 22% and 14%; for T1-2, N0 and R0 tumors, the corresponding survival rates were 49% and 31%. Carbohydrate antigen 19-9 >75 kU/L, carcinoembryonic antigen >5 mu g/L, N1, lymph-node ratio >20%, R1, and lack of adjuvant therapy were independent risk factors for decreased survival. Conclusion: After centralization of pancreatic surgery in southern Finland, we have managed to enable pancreatic ductal adenocarcinoma patients to survive markedly longer than in the early 1990s. Based on a 1.7-million population in our clinic, mortality rates are equal to those of other high-volume centers and long-term survival rates for pancreatic ductal adenocarcinoma have now risen to some of the highest reported.Peer reviewe

    A Great Escape : resource availability and density-dependence shape population dynamics along trailing range edges

    Get PDF
    This research was funded by the Northeast Climate Adaptation Science Center, which is managed by the USGS National Climate Adaptation Science Center. Additional funding was provided by 1) a CFDA grant (15.678) administered by the USFWS via a Cooperative Agreement Award (no. F16AC00435) to the University of Massachusetts (UMass); 2) a Challenge Cost Share Agreement (no. 14-CS-11092200-019) between the USFS and NHFG; 3) a Dissertation Fieldwork Grant awarded to APKS by the UMass Graduate School, 4) generous support from backers of an Experiment award to APKS and MZ (DOI: 10.18258/10737) and 5) a National Science Foundation grant DEB-1907022 to LSM.Populations along geographical range limits are often exposed to unsuitable climate and low resource availability relative to core populations. As such, there has been a renewed focus on understanding the factors that determine range limits to better predict how species will respond to global change. Using recent theory on range limits and classical understanding of density dependence, we evaluated the influence of resource availability on the snowshoe hare Lepus americanus along its trailing range edge. We estimated variation in population density, habitat use, survival, and parasite loads to test the Great Escape Hypothesis (GEH), i.e. that density dependence determines, in part, a species' persistence along trailing edges. We found that variability in resource availability affected density and population fluctuations and led to trade-offs in survival for snowshoe hare populations in the northeastern USA. Hares living in resource-limited environments had lower and less variable population density, yet higher survival and lower parasitism compared to populations living in resource-rich environments. We suggest that density-dependent dynamics, elicited by resource availability, provide hares a unique survival advantage and partly explain persistence along their trailing edge. We hypothesize that this low-density escape from predation and parasitism occurs for other prey species along trailing edges, but the extent to which it occurs is likely conditional on the quality of matrix habitat. Our work indicates that biotic factors play an important role in shaping species' trailing edges and more detailed examination of non-climatic factors is warranted to better inform conservation and management decisions.Publisher PDFPeer reviewe

    Local climate determines vulnerability to camouflage mismatch in snowshoe hares

    Full text link
    AimPhenological mismatches, when life‐events become mistimed with optimal environmental conditions, have become increasingly common under climate change. Population‐level susceptibility to mismatches depends on how phenology and phenotypic plasticity vary across a species’ distributional range. Here, we quantify the environmental drivers of colour moult phenology, phenotypic plasticity, and the extent of phenological mismatch in seasonal camouflage to assess vulnerability to mismatch in a common North American mammal.LocationNorth America.Time period2010–2017.Major taxa studiedSnowshoe hare (Lepus americanus).MethodsWe used > 5,500 by‐catch photographs of snowshoe hares from 448 remote camera trap sites at three independent study areas. To quantify moult phenology and phenotypic plasticity, we used multinomial logistic regression models that incorporated geospatial and high‐resolution climate data. We estimated occurrence of camouflage mismatch between hares’ coat colour and the presence and absence of snow over 7 years of monitoring.ResultsSpatial and temporal variation in moult phenology depended on local climate conditions more so than on latitude. First, hares in colder, snowier areas moulted earlier in the fall and later in the spring. Next, hares exhibited phenotypic plasticity in moult phenology in response to annual variation in temperature and snow duration, especially in the spring. Finally, the occurrence of camouflage mismatch varied in space and time; white hares on dark, snowless background occurred primarily during low‐snow years in regions characterized by shallow, short‐lasting snowpack.Main conclusionsLong‐term climate and annual variation in snow and temperature determine coat colour moult phenology in snowshoe hares. In most areas, climate change leads to shorter snow seasons, but the occurrence of camouflage mismatch varies across the species’ range. Our results underscore the population‐specific susceptibility to climate change‐induced stressors and the necessity to understand this variation to prioritize the populations most vulnerable under global environmental change.Peer Reviewedhttps://deepblue.lib.umich.edu/bitstream/2027.42/154444/1/geb13049.pdfhttps://deepblue.lib.umich.edu/bitstream/2027.42/154444/2/geb13049_am.pdfhttps://deepblue.lib.umich.edu/bitstream/2027.42/154444/3/geb13049-sup-0001-Supinfo.pd

    Structural and Biomolecular Analyses of Borrelia burgdorferi BmpD Reveal a Substrate-Binding Protein of an ABC-Type Nucleoside Transporter Family

    Get PDF
    Borrelia burgdorferisensu lato, the causative agent of tick-borne Lyme borreliosis (LB), has a limited metabolic capacity and needs to acquire nutrients, such as amino acids, fatty acids, and nucleic acids, from the host environment. Using X-ray crystallography, liquid chromatography-mass spectrometry, microscale thermophoresis, and cellular localization studies, we show that basic membrane protein D (BmpD) is a periplasmic substrate-binding protein of an ABC transporter system binding to purine nucleosides. Nucleosides are essential for bacterial survival in the host organism, and these studies suggest a key role for BmpD in the purine salvage pathway of B. burgdorferi sensu lato Because B. burgdorferisensu lato lacks the enzymes required for de novo purine synthesis, BmpD may play a vital role in ensuring access to the purines needed to sustain an infection in the host. Furthermore, we show that, although human LB patients develop anti-BmpD antibodies, immunization of mice with BmpD does not confer protection against B. burgdorferi sensu lato infection.</p

    BENCHOP - The BENCHmarking project in Option Pricing

    Get PDF
    The aim of the BENCHOP project is to provide the finance community with a common suite of benchmark problems for option pricing. We provide a detailed description of the six benchmark problems together with methods to compute reference solutions. We have implemented fifteen different numerical methods for these problems, and compare their relative performance. All implementations are available on line and can be used for future development and comparison

    Inference of population splits and mixtures from genome-wide allele frequency data

    Full text link
    Many aspects of the historical relationships between populations in a species are reflected in genetic data. Inferring these relationships from genetic data, however, remains a challenging task. In this paper, we present a statistical model for inferring the patterns of population splits and mixtures in multiple populations. In this model, the sampled populations in a species are related to their common ancestor through a graph of ancestral populations. Using genome-wide allele frequency data and a Gaussian approximation to genetic drift, we infer the structure of this graph. We applied this method to a set of 55 human populations and a set of 82 dog breeds and wild canids. In both species, we show that a simple bifurcating tree does not fully describe the data; in contrast, we infer many migration events. While some of the migration events that we find have been detected previously, many have not. For example, in the human data we infer that Cambodians trace approximately 16% of their ancestry to a population ancestral to other extant East Asian populations. In the dog data, we infer that both the boxer and basenji trace a considerable fraction of their ancestry (9% and 25%, respectively) to wolves subsequent to domestication, and that East Asian toy breeds (the Shih Tzu and the Pekingese) result from admixture between modern toy breeds and "ancient" Asian breeds. Software implementing the model described here, called TreeMix, is available at http://treemix.googlecode.comComment: 28 pages, 6 figures in main text. Attached supplement is 22 pages, 15 figures. This is an updated version of the preprint available at http://precedings.nature.com/documents/6956/version/
    • 

    corecore